[Model]enable glm4-9b #291

intellinjun · 2024-06-13T07:15:12Z

Type of Change

feature or bug fix or documentation or others
API changed or not
not

Description

Enable glm-4-9b
detail description
q4j32cint8 result

Signed-off-by: intellinjun <[email protected]>

a32543254

LGTM

neural_speed/__init__.py

neural_speed/models/chatglm/chatglm2.h

zhentaoyu · 2024-06-13T07:31:52Z

please update its Q4 lambada_openai acc result when your PR ready since GLM has some special tokens

neural_speed/__init__.py

neural_speed/convert/convert_chatglm.py

Signed-off-by: intellinjun <[email protected]>

intellinjun · 2024-06-13T08:07:18Z

https://inteltf-jenk.sh.intel.com/job/neural_speed_extension/173/

Signed-off-by: intellinjun <[email protected]>

enable glm4-9b

723d12a

Signed-off-by: intellinjun <[email protected]>

intellinjun requested review from zhentaoyu, Zhenzhong1 and a32543254 and removed request for zhentaoyu June 13, 2024 07:15

a32543254 approved these changes Jun 13, 2024

View reviewed changes

a32543254 reviewed Jun 13, 2024

View reviewed changes

neural_speed/__init__.py Outdated Show resolved Hide resolved

a32543254 reviewed Jun 13, 2024

View reviewed changes

neural_speed/models/chatglm/chatglm2.h Show resolved Hide resolved

intellinjun added 2 commits June 13, 2024 15:20

Update __init__.py

dc4af54

Update __init__.py

17d0951

zhentaoyu added cpu review_complexity:low extension test labels Jun 13, 2024

zhentaoyu reviewed Jun 13, 2024

View reviewed changes

neural_speed/__init__.py Show resolved Hide resolved

neural_speed/convert/convert_chatglm.py Outdated Show resolved Hide resolved

intellinjun added 2 commits June 13, 2024 15:51

add glm4 extension test

987b050

Signed-off-by: intellinjun <[email protected]>

update huggingface.py

88b0f86

Signed-off-by: intellinjun <[email protected]>

intellinjun and others added 4 commits June 13, 2024 16:41

Update huggingface.py

8eb9332

update huggingface.py

020bdd9

Signed-off-by: intellinjun <[email protected]>

Merge branch 'glm4' of https://github.com/intel/neural-speed into glm4

7fd5577

update huggingface

05d3715

Signed-off-by: intellinjun <[email protected]>

a32543254 merged commit ea20cc2 into main Jun 14, 2024
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model]enable glm4-9b #291

[Model]enable glm4-9b #291

intellinjun commented Jun 13, 2024 •

edited

Loading

a32543254 left a comment

zhentaoyu commented Jun 13, 2024

intellinjun commented Jun 13, 2024

[Model]enable glm4-9b #291

[Model]enable glm4-9b #291

Conversation

intellinjun commented Jun 13, 2024 • edited Loading

Type of Change

Description

a32543254 left a comment

Choose a reason for hiding this comment

zhentaoyu commented Jun 13, 2024

intellinjun commented Jun 13, 2024

intellinjun commented Jun 13, 2024 •

edited

Loading